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Abstract 

Introduction: Given the role of estrogen in breast carcinogenesis and the modification of estrogen receptor (ER) 
activity by its biochemical cofactors, we hypothesize that genetic variation within ER cofactor genes alters cellular 
response to estrogen exposure and consequently modifies the risk for ER-positive breast cancer. 

Methods: We genotyped 790 tagging SNPs within 60 ER cofactor genes in 1,257 cases and 1,464 controls from 
Sweden and in 2,215 cases and 1,265 controls from Finland, and tested their associations with either ER-positive or 
ER-negative breast cancer. 

Results: Seven SNPs showed consistent association with ER-positive breast cancer in the two independent 
samples, and six of them were located within PPARGCIB, encoding an ER co-activator, with the strongest 
association at rs741581 (odds ratio = 1.41, P = 4.84 x 10"^) that survived Bonferroni correction for multiple testing 
in the combined ER-positive breast cancer sample (Pcorrected = 0.03). Moreover, we also observed significant 
synergistic interaction {Pinteraaion = 0.008) between the genetic polymorphisms within PPARGCIB and ESRl in ER- 
positive breast cancer. By contrast, no consistent association was observed in ER-negative breast cancer. 
Furthermore, we found that administration of estrogen in the MCF-7 cell line induced PPARGCIB expression and 
enhanced occupancies of ER and RNA polymerase II within the region of SNP association, suggesting the 
upregulation of PPARGCIB expression by ESRl activation. 

Conclusions: Our study revealed that DNA polymorphisms of PPARGCIB, coding a bona fide ER co-activator, are 
associated with ER-positive breast cancer risk. The feed-forward transcriptional regulatory loop between PPARGCIB 
and ESRl further augments their protein interaction, which provides a plausible mechanistic explanation for the 
synergistic genetic interaction between PPARGCIB and ESRl in ER-positive breast cancer. Our study also highlights 
that biochemically and genomically informed candidate gene studies can enhance the discovery of interactive 
disease susceptibility genes. 



Introduction 

It is known that the risk of breast cancer is related to 
Ufetime exposure to estrogen [1,2]. Estrogen stimulates 
cell proliferation and increases the frequency of sponta- 
neous mutations, leading to a malignant phenotype [3]. 
Breast cells respond to estrogen via estrogen receptors 
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(ERs) through a defined biochemical process: upon 
ligand binding, ERs undergo a conformational change 
that facilitates receptor dimerization, DNA binding, 
recruitment of ER cofactors, and modulation of target 
gene expression [4-6]. 

Endocrine therapy provides strong evidence that 
attenuation of ER (ESRl) activity can reduce breast can- 
cer risk [7], and women with ER-positive tumor would 
be the most likely to benefit from these treatments [7,8]. 
The genetic studies of ESRl, however, have had 
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contradictory results. Only recently, through a very large 
genetic association study, has there been demonstrated a 
small but significant association of polymorphisms 
within ESRl with the risk of breast cancer [9-11]. Two 
plausible explanations for the inconsistent results might 
be due to the small sample sizes and thus limited statis- 
tical power of these studies, or that the risk was not 
evaluated by stratifying breast cancer patients based on 
tumor ER status. However, there is at least one further 
possibility: ER cofactors can either enhance transcrip- 
tional activity of ER as co-activators or inhibit the activ- 
ity as co-repressors. The genetic variants within ER 
cofactors have not been systematically investigated in 
term of association with breast cancer risk, although 
some coding variants within individual genes, such as 
NC0A3 and CCNDl, have been investigated [12-15]. 

Given the modification of ER activity by its cofactors 
through their physical and functional interactions [16], 
the cofactor proteins that bind to ER may be as impor- 
tant as the receptor itself in mediating transcriptional 
response to estrogen exposure [17]. We therefore 
hypothesized that genetic variation within ER cofactor 
genes may alter cellular response to estrogen exposure 
and consequently, alone or by interacting with genetic 
variations within ESRl, modify breast cancer risk in an 
ER status-dependent fashion. To assess this hypothesis, 
we investigated the association of common genetic var- 
iation, using a tagging SNP approach, within 60 cofactor 
genes in two large case-control samples of breast cancer 
from Sweden and Finland, and investigated their inter- 
action with genetic variation within ESRl in terms of 
influencing the risk of hormone-driven breast cancer. 

Materials and methods 

Study population 

The Swedish sample was from a population-based case- 
control study that has been described in detail pre- 
viously [18]. Briefly, 1,322 cases were Swedish-born 
women diagnosed with incident primary invasive breast 
cancer between October 1993 and March 1995 who 
contributed blood samples. All cases were postmeno- 
pausal and between 50 and 74 years of age at diagnosis. 
All the cases were identified through the six regional 
cancer registries in Sweden. The controls (« = 1,524) 
were randomly selected from the Swedish Registry of 
Total Population with no previous breast cancer and 
were frequency-matched for age with the cases. Ques- 
tionnaires were used to collect risk factor information. 

The Finnish sample was from a hospital-based case- 
control study in which the cases consisted of two series 
of unselected breast cancer patients and additional 
familial patients diagnosed at the Helsinki University 
Central Hospital. The first set of cases were 884 patients 
collected in 1997/1998 and 2000, covering 79% of all 



newly diagnosed breast cancer cases during those peri- 
ods [19,20]. The second set of cases, consisting of 986 
newly diagnosed breast cancer patients, were collected 
during 2001 to 2004 and covered 87% of all such 
patients during that period [21]. An additional 538 
familial breast cancer cases were also collected at the 
same hospital, as previously described [22,23]. Women 
with a prior diagnosis of breast cancer in situ were 
excluded, leaving 2,215 invasive breast cancer cases for 
analysis. Healthy female population controls (« = 1,287) 
were collected from the same geographical regions of 
Finland as the cases. 

Information on reproductive and hormonal risk fac- 
tors was available for the Swedish sample and showed 
expected association patterns with breast cancer [24-26]. 
Such information was not available for the Finnish 
controls. 

Hormone receptor status information was retrieved 
from medical records of all participating cases and was 
available for both the Swedish and Finnish cases. 

Approval for the study was obtained from the Institu- 
tional Review Boards in Sweden, Finland and the 
National University of Singapore. All subjects provided 
written informed consent. 

DNA isolation 

DNA was extracted from 4 ml whole blood using the 
QIAamp DNA Blood Maxi Kit (Qiagen, Hflden, Ger- 
many) according to the manufacturer's instructions. 

Candidate gene and tagging SNP selection 

In the present study, the keywords 'ER cofactor', 'ER 
coactivator' and 'ER corepressor' were used in a litera- 
ture search to identify ER cofactor genes. Boolean 
searching ('AND' 'OR') was used to narrow or broaden 
the search in PubMed. Using this method, 60 ER cofac- 
tor genes were identified as candidate genes. Tagging 
SNPs within the 60 candidate genes were selected based 
on the HapMap CEU data (Rel #22/phase II Apr07, on 
NCBI B36 assembly, dbSNP bl26) [27]. In brief, for 
each gene, all common SNPs with a minor allele fre- 
quency >0.05 within the gene and 5 kb surrounding 
region were first identified from the HapMap database 
[28]. Tagging SNPs were then selected in Haploview 
version 4.1 [29] using a pair- wise SNP tagging approach 
with > 0.8 used as the criterion for selection. A total 
of 806 tagging SNPs were selected within the 60 ER co- 
factor genes. 

Genotyping 

Illumina's GoldenGate assay was used for genotyping 
SNPs, following the manufacturers' instructions (Illu- 
mina, San Diego, CA, USA). In brief, all 806 tagging 
SNPs were subjected to genotyping assay design, out of 
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which 790 SNPs were successfully designed and sub- 
jected to genotyping analysis. DNA samples were ran- 
domly assigned to the plates carrying positive and 
negative controls, and all genotyping results were gener- 
ated and checked by laboratory staff unaware of the 
case-control status. SNPs with a call rate <96% (81 
SNPs failed in the Swedish sample and 42 SNPs failed 
in the Finnish sample) and minor allele frequency <1% 
(18 SNPs in the Swedish sample and 40 SNPs in the 
Finnish sample) were excluded from further analysis. 
Deviation of genotype frequencies from those expected 
under Hardy- Weinberg Equilibrium were assessed in the 
control subjects. SNPs with Hardy-Weinberg Equili- 
brium P < 7.4 X lO''^ (0.05/675) were excluded (6 SNPs 
failed in the Swedish sample and 15 SNPs failed in the 
Finnish sample). In total, 685 SNPs from the Swedish 
sample and 693 SNPs from the Finnish sample were 
used for statistical analysis, and 675 shared SNPs 
between the Swedish and Finnish samples were used for 
analysis in the combined sample. 

Genotyping was duplicated in 2% of samples (in both 
Swedish and Finnish samples) and there was concor- 
dance in >99% of the duplicated samples, suggesting 
high genotyping accuracy. With > 0.8, the average 
coverage of common variation (minor allele frequency 
>5%) within the 60 candidate genes was 91%. Out of 
these, 51 genes had coverage over 80% (Additional file 1 
Table SI). 

Reverse transcriptase-quantitative PCR analysis 

MCF-7 cells were cultured in DMEM (Invitrogen, Carls- 
bad, CA, USA) medium with 10% FBS (Invitrogen). 
Prior to hormone treatment, cells were maintained in 
phenol-red free DMEM F-12 containing 5% charcoal 
stripped serum for 72 hours for hormone depletion. 
Cells were treated with 10 nM 17p -estradiol (Sigma- 
Aldrich, St. Louis, MO, USA) for a period of 0 or 3 
hours. Cells were harvested and total RNA and reverse 
transcriptase-quantitative PCR analysis was carried out 
as described previously [30]. Dimethylsulfoxide (Sigma- 
Aldrich, St. Louis, MO, USA)/vehicle-treated cells were 
used as controls for the same time course. Real-time 
PCR analysis was performed in the ABI Prism 7700 
sequence detection system using SYBR Green from ABI 
(Applied Biosystems, Foster City, CA,USA). 

Primers were designed using the online Primer 3 pro- 
gram [31]. All experiments were repeated at least twice. 
Two sets of primers were used for identifying different 
isoforms of PPARGCIB. The oligonucleotide sequences 
were as follows: PPARGC1B_1 isoform (NM_00117 
2699.1) forward 5'-GAAGAGGAAGAAGGGGAGGA-3' 
and reverse 5'-CTCTGGTAGGGGCAGTGGT-3'; and 
PPARGC1B_2 isoform (NM_133263.3) forward 5'- 
CCTGAAGATGACGTGGGTCT-3' and reverse 5'- 



CCTTCCTTCTGGGTGTCAGA-3'. P-Actin specific pri- 
mers (forward 5'-TCCCTGGAGAAGAGCTACGA-3' 
and reverse 5'-AGGAAGGAAGGCTGGAAGAG-3') 
were used as an internal control to normalize the 
amounts of reverse transcribed product used in the PCR 
reaction. Threshold cycle (Ct) values obtained for 
PPARGCIB isoforms were normalized to P-actin Ct 
values. The normalized Ct (ACt) values were then used 
to calculate the difference (AACt) between estradiol- 
treated and dimethylsulfoxide-treated samples. The fold 
change of PPARGCIB was calculated as 2"^'^". 

Statistical analysis 

To measure the magnitude of association between SNPs 
and breast cancer risk, per-allele odds ratios (ORs) 
(assuming a log-additive model) and 95% confidence 
intervals were estimated using logistic regression. As the 
controls were younger than cases in the Finnish sam- 
ples, age at diagnosis/enrollment (as a continuous vari- 
able) was included in the regression models in the 
Finnish analysis for OR adjustment. The Cochran-Armi- 
tage trend test was used to calculate P values in the 
Swedish and Finnish sample sets, separately in subtypes, 
and in cases overall. Inverse variance weighting was 
used in a meta-analysis for two independent datasets. 
The individual OR was obtained from age-unadjusted 
analysis in the Swedish sample and age-adjusted analysis 
in the Finnish sample. To evaluate differences in ORs 
between studies, a test of homogeneity was carried out 
for each individual SNP analysis (data not shown). 

To determine the model of inheritance, associations 
between SNPs within the PPARGCIB gene and ER-posi- 
tive breast cancer risk were estimated by assuming 
dominant, recessive and additive models in the two 
sample sets. We then performed these analyses with 
meta-analysis using inverse variance weighting approach. 
Individual ORs from two independent studies followed- 
up age-unadjusted analysis in the Swedish sample and 
age-adjusted analysis in the Finnish sample. 

Forward stepwise logistic regression was used to 
explore whether the associations at the six SNPs were 
independent of each other. The selection criterion was 
P < 0.2. The analysis was performed in ER-positive 
breast cancer risk in the two sample sets separately as 
well as in the combined ER-positive sample dataset. To 
account for different minor allele frequencies in the two 
populations, a binary indicator variable for study was 
included in the regression models as well as age in the 
combined data regression analysis. 

Pair-wise interaction analysis was performed under a 
dominant mode of inheritance using logistic regression 
and likelihood ratio tests. To maximize the statistical 
power, we pooled sample sets from the Swedish and 
Finnish data. Age and study were included in the model 
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as covariables. The full model included an interaction 
term between the two interacting variables for the risk 
of breast cancer. In this multivariate logistic regression 
analysis, each coefficient provided an estimate of the log 
OR whilst adjusting for all other variables included in 
the model. Likelihood ratio tests, comparing models 
with and without the interaction term, were used to 
generate P values. 

All analyses were performed using STATA version 8.0 
(StataCorp, College station, TX, USA). Linkage disequili- 
brium (LD) calculation was performed in Haploview 
version 4.1 [29]. All statistical tests were two-sided. 

Results 

Study subjects 

Two independent case-control samples of breast cancer 
from Sweden and Finland were investigated in the present 
study, whose characteristics are summarized in Table 1. 
The cases and controls of the Swedish sample were fre- 
quency-matched on age, whereas the Finnish controls 
were younger than the Finnish cases {P < 0.0001). In the 
Swedish sample, there were significant differences between 
the cases and controls in terms of age at first birth {P = 
0.0002), age at menopause (P = 0.0001), hormone replace- 
ment treatment use (P = 0.017), and parity {P = 0.0001), 
which is consistent with the well-established role of these 
reproductive factors in breast cancer development. The 
reproductive factor information was not available for the 
Finnish controls. In both the Swedish and Finnish cases, 
there were similar percentages of ER-positive (81.9% vs. 
80.9%) and ER-negative (18.1% vs. 19.1%) cases. 

SNP association analysis 

First, single SNP association analyses were performed 
using trend tests in the Swedish and Finnish samples 
separately by stratifying the cases into ER-positive and 
ER-negative groups, with 685 SNPs being tested in the 



Swedish sample and 693 SNPs being tested in the Fin- 
nish sample. 48 SNPs (7.00%) in the Swedish sample 
and 50 SNPs (7.28%) in the Finnish sample showed 
association with ER-positive breast cancer risk with 
nominal P < 0.05. Seven SNPs showed consistent asso- 
ciation between the two independent samples (Addi- 
tional file 1 Table S2), and six of them were located 
within the PPARGCIB gene. In contrast, 21 and 50 
SNPs showed association with ER-negative breast cancer 
with nominal P < 0.05 in the Swedish and Finnish sam- 
ples, respectively, but no SNPs showed consistent asso- 
ciations between the two independent samples. 

We then analyzed SNP associations in the combined 
Swedish and Finnish samples. In general, SNPs showed 
stronger evidence of association with ER-positive breast 
cancer than ER-negative breast cancer (Table 2; see also 
Additional file 1 Table S3). The most significant associa- 
tion was identified at rs741581 within the second intron 
of PPARGCIB (OR = 1.41, P = 4.84 x lO '^) in ER-posi- 
tive breast cancer, which survived the Bonferroni correc- 
tion for multiple testing (-Pcorrected = 0.03). rs741581 was 
one of the seven SNPs that showed consistent associa- 
tions between the Swedish and Finnish samples. 

We also evaluated the SNP association with overall 
breast cancer risk and found 55 SNPs (8.03%) from the 
Swedish samples and 61 SNPs (8.80%) from the Finnish 
samples to show association with overall breast cancer risk 
with nominal P < 0.05. Only two SNPs, however, showed 
consistent association between the two independent sam- 
ples (Additional file 1 Table S2), and none of the associa- 
tions survived Bonferroni correction for multiple testing in 
the combined samples (smallest ^corrected = 0.198). 

Genotype association analysis of PPARGCIB in ER-positive 
breast cancer 

PPARGCIB is located on 5q33.1 and encodes for peroxi- 
some proliferative activated receptor gamma coactivator 



Table 1 Selected characteristics of cases and controls in the Swedish and the Finnish samples 



Characteristic 



Swedish sample 



Finnish sample 



Number 



Mean 



P value' 



Number 



Mean 



P value' 



Entire study 
Age (years) 
Age at first birth 
Age at menopause 
BMI (recent) 

Case only 
All cases 
ER-positive 
ER-negative 
Controls 



1,257/1,464 
1,072/1,321 
1 ,247/1 ,460 
1 ,250/1 ,443 



Number 



1,257 
684 
151 
1,464 



63.11/63.12 
25.50/2476 
50.6/49.97 
25.71/25.67 



0.96 

2.00 X 10'^ 
1.00 X 10"^ 
0.81 



2,214/1,265 
1,185/- 
1,341/- 
1,525/- 



56.07/40.88 
26.43/- 
50.34/- 
25.03/- 



Percentage 



Number 



Percentage 



/ 

81.92 
18.08 
/ 



2,215 
1,709 
403 
1265 



/ 

80.92 
19.08 
/ 



< 1.00 X 10'" 

/ 

/ 

/ 



Data presented as cases/controls. BMI, body mass index; ER, estrogen receptor. ^Two-sided f test was used for P value estimation. 
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Table 2 Twenty-five most significant SNPs associated with ER-positive breast cancer in Swedish and Finnish samples 



Gene 



SNP 



Position 



P value^ 



Adjusted P value 



OR (95% C\f 



rr/MnuL / D 


rc741 "^Rl 
\ DO 1 


LI II 3. 1 '-fy 1 ozy / o 


A9.A ^ 


lU U.Ujj 


1 A^A M 1Q7 1 f\71\ 
\ .H \ H \ \ Az) / , 1 .0/ Zj 




1 i 1 U 1 


LI II 3. 1 '-fy 1 J / 1 JO 


Q QR V 

y.yo X 


1 n"^ n c\fd 

lU U.UD/ 


1 .ZZj I, I . I Uj, I .J J J^ 


PPAQrn R 




Ll 11 J. 1 ^y 1 ZU4j J 


7 7'5 v 
Z./j X 


1 (Y^ n 1 Q/4 
1 U U.I o'4- 


1 77f; (^ noQ 1 ■5f^/^^ 

1 .ZZj 1, 1 .Uyo, 1 .jDD^ 


L/An/W / 


1 S 1 dZ^/ I 1 


Ll 1 1 1 y . 1 Uoo44 j'^- 


H.ZO X 


1 cy^ n TQR 

1 U U.Zoo 


1 .zzy \ 1 .uyo, i .j / o^ 




rs7469 


rhrl A01AAf]f]'^7 
LIU 1 'H-.ZZ'H-'H-UU J / 


1 1 5 X 


-|Q"3 0 778 


1 .Z^O V 1 -Uyz, 1 r\Z/ I 


ppARf^n R 




Ll II J. 1 ^yuyj 1 


1 7P V 
1 ./ O X 


1 u 


1 . 1 yj \ 1 .uoo, 1 .jjj^ 


Kir DP 7 


\b \ Uo^OO/ u 


Ll II 1 Z. 1 Zj^ju I OH 


1 f^7 V 
Z.D/ X 


1 u 


u.o/z ^u./yo, u.yj^^ 




\bjo \ \ \ O/ 


LI II 1 ^.ZZ^ jy 1 j^ 


z.yo X 


1 u 


1 . 1 JO \ 1 .Uj I , \ .Z/ Dj 


PFI PI 




rhrl 7-A^lQn71 

LI II \ 1 .'-tjzy / / z 


J.Z J X 


1 u 


1171 (^ C\'^A 1 'X\ 
1 . 1 / 1 \ \ .Uj-H, I .jj 


ppA RnnR 

rr /inui_ / D 


1 bZ J'-hUOZ 1 


rhr^^'l 4Q1 79^riQ 
Ll 1 1 J. 1 ^y 1 ZZ JKJy 


4 00 X 




1 . 1 ^y 1, 1 .U'H-j, 1 .zoz) 






rhrl 1 ■^^Q1 7^Q7A 

LI II 1 1 .oy 1 / jy / "4- 


'-f.^j X 


1 u 


n R77 /Tl PHI n QA^l 
u.o// i^u.ou 1 , u.yo^ 


ppARf^n R 


lb 1 UUjOD JO 


rhr^'1 4Q1 ^^7R1 

LI II J. 1 '-fy 1 J J / o 1 


J.J 1 X 


10-3 
1 u 


1 1 "^(^ n DA^ 1 9P'l 
1 . 1 JO \ 1 .U-Hj, I .Zo^ 


PFI PI 


1 b/ Z 1 'H-OjD 


rhrl 7A'^A77f\Q 

LI II 1 / .HDH / / oy 


J.D/ X 


10-3 

1 u 


1 . 1 DO \ 1 .U-HO, 1 .3) 


ppARf^n R 


1 b^/UD JOZ 


rhr^'1 AQ1 f^^ ^^Q 

LI II J. 1 'H-y 1 o 1 J jy 


O.jZ X 


10-3 
1 U 


nR79 /'n7Q1 n QA9'l 

u.o/z i,u./yi, u.yoz^ 




1 b 1 1 U/ 1 Zz^ 


Liii 1 J.J jyuzo 1 / 


D.jZ X 


1 u 


n 7f^/i (Tl n o97^ 
u./D^ i^u.oj, u.yz/^ 


MED13 


r54968469^ 


chrl 7:57491 867 


6.90 X 


10-^ 


1.145 (1.038, 1.263) 


NC0R2 


rsl 2321 007 


chrl 2:1 23449054 


7.44 X 


10-^ 


1.137 (1.035, 1.25) 


MED13 


rs9889324^ 


chrl 7:57481 404 


8.89 X 


10-^ 


1.14 (1.033, 1.258) 


NC0R2 


rsl 794973 


chrl 2:1 23391 545 


9.45 X 


10-^ 


0.889 (0.813, 0.972) 


PPARGCIB 


rsl 422429 


chr5:149146627 


1.11 X 


10-^ 


1.124 (1.027, 1.231) 


NC0R2 


rsl 0846666 


chrl 2:1 23450306 


1.17 X 


10-2 


0.867 (0.776, 0.969) 


NCOAl 


rsl 704651 3 


chr2:24817999 


1.26 X 


10-2 


1.284 (1.055, 1.563) 


NCOAl 


rsl 7046462 


chr2:24759054 


1.37 X 


10-2 


1 .285 (1 .053, 1 .568) 


NC0R2 


rsl 0846667 


chrl 2:1 23450377 


1.40 X 


10-2 


0.895 (0.819, 0.978) 


SNWl 


rs3759728^ 


chrl 4:7729991 2 


1.49 X 


10-2 


0.855 (0.753, 0.97) 



ER, estrogen receptor; chr, chromosome; OR, odds ratio; CI, confidence interval. value and OR were obtained from meta-analysis based on the inverse variance 
method for two independent datasets. The individual OR was obtained from age-unadjusted analysis in the Swedish sample and age-adjusted analysis in the 
Finnish sample. ''P value was adjusted by Bonferroni correction (n = 675); -, adjusted P > 1. '^SNP belongs to the top 25 most significant SNPs associated with ER- 
negative breast cancer. ^SNP was significantly associated with ER-positive breast cancer in both the Swedish and Finnish datasets. 



beta protein (PGC-ip), a bona fide co-activator of ERa. 
In the present study, 40 tagging SNPs within 
PPARGCIB were successfully genotyped in both the 
Swedish and Finnish samples, which could capture 80% 
of common variants (131 out of 162 SNPs) within 
PPARGCIB with a minimal r value of 0.8 (mean r 
value = 0.95, according to HapMap CEU data). 

To have a better understanding of the association 
within PPARGCIB, we performed genotype-based asso- 
ciation analysis by assuming dominant, recessive and 
additive model of inheritance. We found that the top 
three SNPs yielding the most significant association evi- 
dence in the dominant model compared with other 
models (Additional file 1 Table S5). Under the dominant 
model, the same six SNPs (as for the trend tests) of the 
40 SNPs within PPARGCIB showed consistent associa- 
tion with ER-positive breast cancer between the Swedish 
and Finnish samples (Table 3). The strength of the asso- 
ciation (ORs) at the six SNPs was stronger in ER-posi- 
tive breast cancer than in overall or ER-negative 
cancers, with the strongest association identified at 
rs741581 (P = 1.9 x 10'^ in the Swedish samples, P = 



6.1 X 10'^ in the Finnish samples, and P = 1.8 x 10'^ in 
the combined samples). 

The six SNPs showing consistent association with 
ER-positive breast cancer were located within two 
regions of high LD (Figure IB), suggesting that the 
associations at those SNPs may not be completely 
independent. We therefore performed a forward step- 
wise logistic regression (cut-off P = 0.20) and revealed 
two independent associations with ER-positive breast 
cancer at rs741581 (P = 0.031) and rs6895698 {P = 
0.014) in the combined sample. Under the dominant 
model, we found that rs741581, rs6895698, age and 
study sample were four independent variables asso- 
ciated with ER-positive breast cancer risk. Similarly, 
the same stepwise analysis of ER-positive breast cancer 
in the two individual samples also revealed two inde- 
pendent associations at rs741581 (P = 0.172) and 
rs2340621 (P = 0.036) in the Swedish sample and at 
rs741581 (P = 0.023) and rs6895698 (P = 0.053) in the 
Finnish sample. Notably, rs6895698 and rs2340621 lie 
within the same LD block and are highly correlated 
(r^ = 0.72, according to HapMap CEU data). 
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Table 3 Six overlapping SNPs in PPARGCIB associated with ER-positive breast cancer in Swedish and Finnish samples 


SNP 


Allele^ 


Subtype 




Swedish sample 




Finnish sample 


maf'' 


OR*' (95% CI) 


map'' 


OR'' (95% CI) 


rs4705365 


G/A 


ERh- 


0.21 


1.26 (1.05, 1.52) 


0.1 7 


1.26 (1.05, 1.52) 






ER- 




1.14 (0.81, 1.61) 




1.17 (0.9, 1.52) 






All cases 




1.14 (0.97, 1.33) 




1.18 (0.99, 1.4) 


rs6895698 


G/A 


ER-h 


0.22 


1.27 (1.06, 1.53) 


0.17 


1.39 (1.15, 1.67) 






ER- 




1.05 (0.74, 1.48) 




1.14 (0.88, 1.49) 






All cases 




1.12 (0.96, 1.3) 




1.25 (1.05, 1.49) 


rs2340621 


G/A 


ER-F 


0.31 


1.3 (1.08, 1.57) 


0.32 


1.22 (1.03, 1.46) 






ER- 




0.86 (0.61, 1.2) 




1.05 (0.82, 1.34) 






All cases 




1.12 (0.96, 1.31) 




1.14 (0.97, 1.34) 


rsl 0036538 


C/G 


ER-h 


0.26 


1.19 (0.99, 1.42) 


0.22 


1.2 (1, 1.43) 






ER- 




0.91 (0.65, 1.28) 




1.05 (0.81, 1.35) 






All cases 




1.03 (0.89, 1.2) 




1.11 (0.94, 1.31) 


rs1012543 


A/G 


ER-h 


0.26 


1.26 (1.05, 1.51) 


0.23 


1 .26 (1 .06, 1 .5) 






ER- 




1.08 (0.77, 1.51) 




1 .08 (0.84, 1 .38) 






All cases 




1.11 (0.95, 1.29) 




1.18 (1, 1.39) 


rs741581 


G/A 


ER-E 


0.08 


1.32 (1.05, 1.67) 


0.05 


1.76 (1.33, 2.31) 






ER- 




0.81 (0.49, 1.32) 




1.21 (0.81, 1.82) 






All cases 




1.12 (0.92, 1.37) 




1.53 (1.18, 1.98) 



ER+, estrogen receptor-positive; ER-, estrogen receptor-negative; MAP, minor allele frequency; OR, odds ratio; CI, confidence interval. ^Major allele/minor allele. 
'^From control samples only. ^'ORs were performed on a dominant model, age-unadjusted analysis in the Sv\/edish sample and age-adjusted analysis in the Finnish 
sample. 



Association of ESR1 variation with ER-positive breast 
cancer 

Our previous study suggested an association between 
ESRl polymorphisms and breast cancer risk [9,32] in 
the same Swedish sample. The association was within a 
region flanked by rs988328 to rs3020318 and was mani- 
fested by three haplotypes. Using the haplotype informa- 
tion from the HapMap CEU data, we identified three 
common SNPs that were in high LD (r^ = 0.89) with 
one of the three haplotypes (TAG18~21) [9], while no 
SNPs were found with > 0.5 for the other two haplo- 
types, based on the HapMap CEU data (Rel #22/phase 
II Apr07, on NCBI B36 assembly, dbSNP bl26). Given 
that the three SNPs were in perfect LD (r^ = 1), we 
genotyped one of the three SNPs, rs7761846, in our 
Swedish and Finnish samples. Given that a large associa- 
tion study of ESRl by the Breast Cancer Association 
Consortium also revealed a significant association within 
the same region under a dominant model [10], we 
searched for SNPs that were in high LD (r^ > 0.5) with 
the three haplotypes but were not genotyped in our pre- 
vious study. Then we performed a genotype-based asso- 
ciation analysis under a dominant model of inheritance. 
As expected, rs7761846 showed association with ER- 
positive breast cancer (OR = 1.28, P = 0.014) in the 
combined sample. The two independent Swedish and 
Finnish samples also revealed consistent association, 
although the association in the Finnish sample did not 
reach statistical significance (Table 4). 



Genetic interaction between the polymorphisms of 
PPARGCIB (rs6895698, rs2340621 and rs741581) and ESRl 
(rs7761846) 

Given the known modification of ER activity by 
PPARGCIB in cellular response to estrogen exposure, we 
investigated the genetic interaction between rs741581, 
rs2340621 and rs6895698 within PPARGCIB and 
rs7761846 within ESRl in terms of modulating ER-posi- 
tive breast cancer risk. The analysis in the combined 
sample identified a significant synergistic interaction 
between rs2340621 (representing PPARGCIB) and 
rs7761846 (representing ESRl) (^interaction = 0.008) 
(Table 5). Women carrying both PPARGCIB (rs2340621) 
and ESRl (rs7761846) risk genotypes {GA/AA and CT/ 
CC) had a much higher risk for breast cancer than non- 
carriers (GG and TT) (OR = 1.94, P = 2.03 x 10'^). Simi- 
lar patterns of genetic interaction were also observed 
between the remaining SNPs rs741581 (PPARGCIB) and 
rs7761846 (ESRl) as well as rs6895698 (PPARGCIB) and 
rs7761846 (ESRl), although these interactions did not 
achieve statistical significance - probably due to the low 
minor allele frequencies of rs741581 and rs6895698. 
However, the significant genetic interactions could not 
be detected in overall or ER-negative breast cancer 
(Additional file 2, Tables S6 and S7). 

Transcriptional regulation of PPARGCIB by ERa 

To understand the molecular mechanism underlying the 
observed genetic interaction, we investigated whether 
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Figure 1 Transcriptional regulation of PPARGC1B by estrogen receptor alpha in the IV1CF7 cell line (A) Schematic diagram of the genes 
from the UCSC database. (B) Map positions of six significant SNPs within PPARGCIB. (C) Recombination rate surrounding PPARGCIB from the 
HaplVlap CEU database. (D) MCF7 input DNA density for chromatin immunoprecipitation-sequencing (CHiP-seq) analysis. (E) CHIP-seq RNA Polll 
occupancy density without 1 7p-estradiol (E2) treatment (F) CHiP-seq estrogen receptor (ER) occupancy without E2 treatment. (G) CHIP-seq RNA 
Poill occupancy density with E2 treatment (H) CHIP-seq ER occupancy with E2 treatment. (I) ER interaction loop detected by chromatin 
interaction analysis by paired-end tag sequencing (ChlA-PET). 



Li ef al. Breast Cancer Research 201 1, 13:R10 
http://breast-cancer-research.eom/content/13/1/R10 



Page 8 of 12 



Table 4 Association analysis of rs7761846 within ESR1 



under a dominant model In ER-posltive case analysis 



Sample 


Control 


Case 


OR (95% CI) 


P value 


Swedish 


1,442 


675 


1.43 (1.10, 1.86) 


0.007 


Finnish 


1,246 


1669 


1.10 (0.81, 1.48) 


0.55 


Combined"' 


2,688 


2344 


1.28 (1.05, 1.56) 


0.014 



ER, estrogen receptor; OR, odds ratio; CI, confidence interval. ^Combined 
analysis was performed on logistic regression adjusted by age and study. 



there was any transcriptional cross-talk between the two 
genes beyond the known ligand-dependent, co-activating 
interaction of the PGC-ip with ERa [33,34], using the 
ER-responsive MCF7 breast cancer cell line. 

First, we examined the expression of PPARGCIB in 
MCF7 and noted a twofold induction of PPARGCIB 
expression by ER activation after estradiol administra- 
tion (Additional file 3 Figure SI). As a marker of 
transcriptional activity, chromatin immunoprecipitation- 
sequencing analysis in the same MCF7 cell line identified 
a significant peak of RNA polymerase II occupancy close 
to the transcriptional start site of PPARGCIB within the 
LD region of SNP association, and the RNA polymerase 
II occupancy was further enhanced by estradiol treat- 
ment. This observation confirms the transcriptional 
responsiveness of PPARGCIB to estradiol. Moreover, the 
chromatin immunoprecipitation-sequencing analysis also 
identified five ER binding sites in and around PPARGCIB 
(one site located approximately 50 kb 5' of the transcrip- 
tional start site, one in the second intron of the gene 
within 13 kb of the associated SNP rs741581, and the 
other three binding sites approximately 10, 31 and 57 kb 
3' of the polyadenylation signal sequence) and within the 
LD region of significant association with ER-positive 
breast cancer (Figure IF). Interestingly, the sites showing 
highest of ER occupancy were seen at two locations, one 



~13 kb from the significant SNP rs741581 and the 
second within 31 kb 3' of the polyadenylation signal 
sequence. 

We recently described the identification of all ER 
binding site interactions in the human genome [35,36] 
and defined that genes engaged in chromatin loop for- 
mation by a transcription factor were definitively regu- 
lated by the factor. Our data indicated that all of the ER 
binding sites around PPARGCIB were engaged in chro- 
matin loop formation centered on the PPARGCIB gene 
(Figure II), which indicates that ERa directly regulates 
PPARGCIB. 

Taken together, these data strongly indicate that 
PPARGCIB expression could be directly regulated by 
ERa and - when coupled with the known enhancement 
of ERa transcriptional activity by the PGC-ip at the site 
of binding - suggest a feed-forward regulatory loop 
between the two genes that augments ER signaling 
when the two factors are present. 

Discussion 

To our knowledge, this is the first comprehensive asso- 
ciation analysis of common variation within ER cofac- 
tor genes in breast cancer where 36 ER co-activators 
and 24 ER co-repressors were investigated. The utiliza- 
tion of two independent case-control samples of north- 
ern European origin allowed us to identify an 
association based not only on the overall significance 
in the large combined sample, but also on the consis- 
tency of the SNP association between the two indivi- 
dual samples. We found significant associations 
between PPARGCIB polymorphisms and risk for ER- 
positive breast cancer, and, importantly, we revealed a 
synergistic effect between the genetic polymorphisms 
within PPARGCIB and ESRl. 



Table 5 Pair-wise interaction between SNPs within PPARGCIB and ESff? on ER-positive breast cancer in combined 
Swedish and Finnish samples 



ESRl (rs7761846) PPARGCIB 





Cases (%) 


Controls (%) OR (95% CI) 


Cases (%) 


Controls (%) OR (95% CI) 






GG (rs2340621) 




GA/AA (rs2340621) 


TT 

cr/cc 

Interaction P value"' 


916 (39) 
106 (5) 
0.008 


1,121 (42) 1 

151 (6) 0.95 (0.71, 1.29) 


1,160 (50) 
161 (7) 


1,280 (48) 1.18(1.03,1.34) 
134 (5) 1.94 (1.47,2.55) 






GG (rs6895698) 




GA/AA (rs6895698) 


TT 

CT/CC 

Interaction P value^ 


1,284 (55) 
160 (7) 
0.506 


1,566 (58) 1 

184 (7) 121 (0.93, 1.55) 


791 (34) 
106 (5) 


837 (31) 1.28 (1.11,1.47) 
101 (4) 1.77 (1.30, 2.42) 






GG (rs741581) 




GA/AA (rs741581) 



TT 1,741(74) 2,076 (77) 1 335 (14) 326 (12) 141(1.17,1.70) 

CT/CC 225 (10) 251(9) 1.25(1.01,1.55) 42 (2) 34 (1) 218 (1.32,3.59) 

Interaction P value" 0459 



OR, odds ratio; CI, confidence interval. ^Analysis was performed on combined dataset, in which study and age were regarded as covariables. 
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Genetic association studies of ER cofactor genes have 
so far been limited. Burwinkel and colleagues reported a 
significant association of coding variants Q586 H and 
T960T of NCOA3 with familial breast cancer risk, and 
further suggested that familial breast cancer patients may 
condense the rare allele's contribution to the protective 
effect of breast cancer [12]. Whilst two studies have 
reported an association of the variant Pro241Pro in 
CCNDl with breast cancer risk [37,38], other studies 
have reported negative results for this variant [14,39,40]. 
In particular, Wirtenberger and colleagues investigated 
the coding variant Ala203Pro oi PPARGCIB and found it 
to be associated with familial breast cancer susceptibility 
[41]. In our study, we did not observe significant associa- 
tion between polymorphisms in NCOA3 and CCNDl 
with breast cancer risk. The Ala203Pro (rs7732671) var- 
iant of PPARGCIB, however, is 10 kb away and not cor- 
related with PPARGCIB SNP rs741581 (r^ < 0.05 in 
HapMap CEU data), and thus would not have been 
detected by our tagging SNP approach. Nevertheless, 
both Wirtenberger and colleagues' study and our study 
support the association of genetic variation of 
PPARGCIB with particular subtypes of breast cancer. 

Importantly, the association of PPARGCIB as well as 
its synergistic interaction with ESRl was only observed 
in breast cancer patients with ER-positive tumors, as 
would be expected according to the biochemical 
mechanism of interaction. There is growing evidence 
that the impact of genetic risk factors on breast cancer 
varies by hormone receptor status. For example, recent 
studies by the Breast Cancer Association Consortium 
have led to the discovery of novel breast cancer suscept- 
ibility loci in FGFR2, TNRC9, 8q24, 2q35, and 5pl2 that 
showed stronger association with ER-positive disease 
than with ER-negative disease [42-45], with fibroblast 
growth factor receptor also being a direct target of ER. 
These data suggest the risk of ER-positive tumors that 
has been shown to be driven by reproductive factors in 
epidemiologic studies also has a genomic basis based on 
the constituents of the ER gene regulatory network 
[46,47]. In our study, although the sample sizes of two 
ER-positive datasets were smaller compared with the 
two overall datasets, the number of overlapping SNPs 
between the Swedish and Finnish studies was thus larger 
than that observed in the overall breast cancer analysis. 
Recently, we also demonstrated that genetic variation of 
the estrogen metabolism pathway - particularly the 
genes involved in the production of estrogen through 
androgen conversion - also influences the risk for the 
development of estrogen-sensitive breast cancer [48]. As 
with this study, the effect size of the metabolism gene 
polymorphisms are relatively small but, taken together 
with PPARGCIB and fibroblast growth factor receptor, 
show that the estrogen receptor signaling axis that 



engages both upstream and downstream components 
may have, in the composite, a significant role in the 
genesis of the most common form of breast cancer. 

The genetic interaction between PPARGCIB and ESRl 
is biologically plausible. The PPARGCIB protein PGC- 
ip is a bona fide ER co-activator [34] that physically 
interacts with ERa and plays a role in amplifying ER sig- 
naling, which provides a convincing biological mechan- 
ism for the observed genetic interaction between the 
two genes. Furthermore, our series of transcriptional 
regulation analyses in the MCF7 ER-positive breast can- 
cer cell line has demonstrated that PPARGCIB expres- 
sion can be induced by estrogen treatment, and this 
transcriptional response of PPARGCIB is probably 
mediated by five functional ER binding sites around 
PPARGCIB that are all engaged in interlocking chroma- 
tin loops highly indicative of an ER regulated gene [35]. 
PPARGCIB may thus be involved in a feed-forward con- 
trol mechanism with ERa such that ER induction (for 
example, by estradiol treatment) heightens the expres- 
sion of a co-activator PPARGCIB of ER, which in turn 
increases ER action at the DNA binding site. The feed- 
forward looping mechanism will therefore further aug- 
ment the protein interaction between PPARGCIB and 
ESRl. This putative amplification effect, if confirmed, is 
another mechanistic model for epistatic interactions 
between genetic loci and may be one reason for the 
strength of its signal in the association study as com- 
pared with the other ER cofactors studied. 

There are some limitations to our study. Coverage of 
common variation is not sufficient (< 80%) for some 
genes (Additional file 1, Table SI), so that some associa- 
tions may have been missed. In addition, our tagging 
SNP selection provides a rather limited coverage of 5 kb 
surrounding sequences of the candidate genes, which 
may have contributed to some associations of regulatory 
SNPs being undetected, such as the one reported within 
ESRl [11]. The number of overlapping SNPs between 
the two datasets is small for both ER-positive and over- 
all breast cancer analyses. The limited overlapping could 
be due to ethnic heterogeneity between the two popula- 
tion samples and their moderate sample sizes. On the 
one hand, the ethnic heterogeneity may partially explain 
the low overlapping SNPs between two datasets; on the 
other hand, the current sample size is not large enough 
to capture the moderate effect of associated SNPs. Some 
of the top SNPs for each individual sample set are 
therefore probably false positive, which causes the small 
overlap between the numbers of significant SNPs in 
both datasets. The sample size limitation in ER-negative 
patients also could lead to the nonsignificant results in 
ER-negative analysis, since we observed that some asso- 
ciations in ER-negative analysis are in the same direc- 
tion with ER-positive analysis. ER cofactors are known 
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to work as a multicomponent protein complex, but due 
to a sample size limitation we are unable to detect inter- 
action among three or more genes simultaneously. It is 
also worth noting that the contribution of genetic varia- 
tion to cancer risk is based on both their prevalence and 
penetrance, and thus the relative importance of indivi- 
dual SNPs may vary from population to population. 
Further confirmation of our findings in other popula- 
tions is therefore warranted. 

Conclusions 

Our study has revealed an association of genetic variation 
within PPARGCIB with the risk of ER-positive breast can- 
cer. Consistent with the known interaction of PPARGCIB 
and ER at the molecular level, where PPARGCIB modu- 
lates ER activity and thus ER signaling, our study revealed 
a synergistic effect between genetic variation within the 
PPARGCIB and ESRl genes. PPARGCIB has been shown 
to alter responses to the selective ER modulator, tamoxifen 
[33]. Kressler and colleagues also demonstrated that 
PPARGCIB indirectly co-activates tamoxifen-bound ERa, 
which cooperates with NCOAl to enable tamoxifen agon- 
ism in kidney and osteosarcoma cell lines. Lastly, the 
synergism demonstrated in the present study also suggests 
that disrupting the interaction between an ER co-activator 
- such as PPARGCIB - and ERa, or blocking their mutual 
activation, may represent a sensitive and leveraged strategy 
for cancer prevention [7]. Our study therefore provides 
new biological insight into the genetic basis of the more 
common ER-positive breast cancer and highlights that bio- 
chemically and genomically informed candidate gene 
study can enhance the discovery of interactive disease sus- 
ceptibility genes. 

Additional material 



Additional file 3: Relative expression of PPARGCIB gene in MCF7 
cells 3 hrs post E2 treatment Figure SI presenting a relative 
expression of tine PPARGCIB gene in MCF7 cells 3 hours post 17p- 
estradiol treatment. 
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